Fast Similarity Search in Three-Dimensional Structure Databases

نویسندگان

  • Xiong Wang
  • Jason Tsong-Li Wang
چکیده

Given a database D of three-dimensional (3D) molecular structures and a target molecule Q, the similarity search problem is to find the molecules O in D that match Q after allowing for an arbitrary number of whole-structure rotations and translations as well as a certain number of edit operations. The edit operations include relabeling an atom, deleting an atom, and inserting an atom. This search operation arises in many biochemical applications. In this paper we study the similarity search problem and a class of related queries. We present a computer vision based technique, called geometric hashing, for processing these queries. Experimental results on a database of 3D molecular structures obtained from the National Cancer Institute indicate the good performance of the presented technique.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Ultrafast shape recognition for similarity search in molecular databases

Molecular databases are routinely screened for compounds that most closely resemble a molecule of known biological activity to provide novel drug leads. It is widely believed that three-dimensional molecular shape is the most discriminating pattern for biological activity as it is directly related to the steep repulsive part of the interaction potential between the drug-like molecule and its ma...

متن کامل

MLR-Index: An Index Structure for Fast and Scalable Similarity Search in High Dimensions

High-dimensional indexing has been very popularly used for performing similarity search over various data types such as multimedia (audio/image/video) databases, document collections, time-series data, sensor data and scientific databases. Because of the curse of dimensionality, it is already known that well-known data structures like kd-tree, R-tree, and M-tree suffer in their performance over...

متن کامل

Fast similarity search on video signatures

Video signatures are compact representations of video sequences designed for efficient similarity measurement. In this paper, we propose a feature extraction technique to support fast similarity search on large databases of video signatures. Our proposed technique transforms the high dimensional video signatures into low dimensional vectors where similarity search can be efficiently performed. ...

متن کامل

Utilization of Principle Axis Analysis for Fast Nearest Neighbor Searches in High-Dimensional Image Databases

This paper presents an efficient indexing method for similarity searches in highdimensional image database by principal axis analysis. Image databases often represent the image objects as high-dimensional feature vectors and access them via the feature vectors and similarity measure. However, the performance of the existing nearest neighbor search methods is far from satisfactory for feature ve...

متن کامل

Fast indexing: a comparative evaluation

In this evaluation the efficiency of three image signature called angular spectrum, Hough based signature and color histogram are tested The first signature is intrinsic hierarchical (deriving from image frequency spectrum) and than non signature space reduction technique is used. The second is a short signature directly indexed and the last (color histogram) need to be reduced for fast indexin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of chemical information and computer sciences

دوره 40 2  شماره 

صفحات  -

تاریخ انتشار 2000